NTCIR-5 Query Expansion Experiments using Term Dependence Models
نویسنده
چکیده
This paper reports the results of our experiments performed for the Query Term Expansion Subtask, a subtask of the WEB Task, at the Fifth NTCIR Workshop, and the results of our further experiments. In this paper we mainly investigated: (i) the effectiveness of query formulation by composing or decomposing compound words and phrases of the Japanese language, which is based on a theoretical framework via Markov random fields, but taking into account special features of the Japanese language; and (ii) the effectiveness of the combination of phrase-based query formulation and pseudo-relevance feedback. We showed that pseudo-relevance feedback worked well, particularly when using query formulation with compound words.
منابع مشابه
Chinese Information Retrieval Based on Document Expansion
This paper describes our work at the sixth NTCIR workshop on the subtasks of monolingual information retrieval (CLIR). This is the second time we have participated in NTCIR. We have used query expansion methods in NTCIR-5 with related term groups, and this time we use document expansion. The traditional information retrieval model has limitations on finding related documents since it simply che...
متن کاملNTCIR-5 CLIR Experiments at Oki
We participated in the SLIR, BLIR(PLIR) and MLIR subtasks of the NTCIR-5 CLIR task. Our IR system uses language models for document scoring and query expansion, and can handle four languages; Chinese, Japanese, Korean and English. The system utilizes multiple language resources (bilingual dictionaries, parallel corpora and machine translation systems). We attempted to use some techniques includ...
متن کاملChinese Information Retrieval Based on Related Term Group
This paper describes our work at the fifth NTCIR workshop on the subtasks of monolingual information retrieval (IR). Query expansions using automatically acquired related term groups were explored. Unlike traditional query expansion methods, the related term groups extracted from web-based corpuses and the related terms extracted from document set are used in combination to improve the effectiv...
متن کاملQuery Formulation by Selecting Good Terms
It is difficult for users to formulate appropriate queries for search. In this paper, we propose an approach to query term selection by measuring the effectiveness of a query term in IR systems based on its linguistic and statistical properties in document collections. Two query formulation algorithms are presented for improving IR performance. Experiments on NTCIR-4 and NTCIR-5 ad-hoc IR tasks...
متن کاملOverview of the NTCIR-5 WEB Query Term Expansion Subtask
The query term expansion subtask was conducted to establish an evaluation framework for information retrieval (IR) systems that focus on the effectiveness of query term expansion techniques. However, the quality of query term expansions are affected by several factors (e.g., IR system using expanded query, quality of initial query, etc.), so it is difficult to evaluate this technique. In this s...
متن کامل